Window Document (Text Processing)
Synopsis
Windows a document and returns a collection of the windows.Description
This operator moves a sliding window over the tokens of a document and returns a collection containing a new document for each window. The size of the sliding window may be adapted as well as the step size the window is moved in each step.
Input
- document
The document port.
Output
- documents (Collection)
The documents port.
Parameters
- window lengthDefines the number of tokens a window covers. The resulting document will contain a token sequence of that length.
- step sizeDefines the number of tokens between the start of two windows. A step size of one would case each token to become first token of a window.
- extend last windowIf checked, the last window will be extended, so that it covers all remaining tokens. Otherwise incomplete windows will be added.
- parallelize segment processingDetermines whether the execution of Segment Processing should be parallelized.